Picture for Jan Skoglund

Jan Skoglund

Binamix -- A Python Library for Generating Binaural Audio Datasets

Add code
May 02, 2025
Viaarxiv icon

Perceptual Audio Coding: A 40-Year Historical Perspective

Add code
Apr 22, 2025
Viaarxiv icon

SCOREQ: Speech Quality Assessment with Contrastive Regression

Add code
Oct 09, 2024
Viaarxiv icon

Neural Speech and Audio Coding

Add code
Aug 13, 2024
Viaarxiv icon

NOMAD: Unsupervised Learning of Perceptual Embeddings for Speech Enhancement and Non-matching Reference Audio Quality Assessment

Add code
Sep 28, 2023
Viaarxiv icon

LMCodec: A Low Bitrate Speech Codec With Causal Transformer Models

Add code
Mar 23, 2023
Viaarxiv icon

Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset

Add code
Sep 14, 2022
Figure 1 for Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset
Figure 2 for Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset
Figure 3 for Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset
Viaarxiv icon

Ultra-Low-Bitrate Speech Coding with Pretrained Transformers

Add code
Jul 05, 2022
Figure 1 for Ultra-Low-Bitrate Speech Coding with Pretrained Transformers
Figure 2 for Ultra-Low-Bitrate Speech Coding with Pretrained Transformers
Figure 3 for Ultra-Low-Bitrate Speech Coding with Pretrained Transformers
Viaarxiv icon

A Comparison of Deep Learning MOS Predictors for Speech Synthesis Quality

Add code
Apr 05, 2022
Figure 1 for A Comparison of Deep Learning MOS Predictors for Speech Synthesis Quality
Figure 2 for A Comparison of Deep Learning MOS Predictors for Speech Synthesis Quality
Figure 3 for A Comparison of Deep Learning MOS Predictors for Speech Synthesis Quality
Figure 4 for A Comparison of Deep Learning MOS Predictors for Speech Synthesis Quality
Viaarxiv icon

SoundStream: An End-to-End Neural Audio Codec

Add code
Jul 07, 2021
Viaarxiv icon